Rank in Wordlist | Frequency | Word |
---|---|---|
3431 | 13 | 1,5 |
3991 | 11 | 3,. |
5293 | 8 | 1,2 |
5294 | 8 | 1,3 |
5308 | 8 | 2,2 |
5910 | 7 | 4,5 |
5917 | 7 | 7,5 |
6153 | 7 | ani,. |
6585 | 7 | seară,. |
6708 | 6 | 1,6 |
Rank in Wordlist | Frequency | Word |
---|---|---|
26840 | 1 | European(CE |
27800 | 1 | Interne(PSD |
28963 | 1 | Nationale(CNADNR |
44104 | 1 | pupa(in |
45869 | 1 | si(sau |
Rank in Wordlist | Frequency | Word |
---|---|---|
15226 | 2 | 1-0),. |
24232 | 1 | AIB),. |
24261 | 1 | ANV). |
30379 | 1 | SAJ). |
31000 | 1 | Syriza),. |
31417 | 1 | USR),. |
38009 | 1 | folosit). |
38077 | 1 | foto),. |
43339 | 1 | posibile). |
45114 | 1 | riscant)? |
Rank in Wordlist | Frequency | Word |
---|---|---|
1501 | 28 | 50% |
1560 | 27 | 10% |
3200 | 14 | 15% |
3204 | 14 | 20% |
3213 | 14 | 30% |
3978 | 11 | 16% |
3993 | 11 | 90% |
4314 | 10 | 1% |
4326 | 10 | 25% |
4329 | 10 | 70% |
Rank in Wordlist | Frequency | Word |
---|---|---|
6955 | 6 | Q&A |
8091 | 5 | RCS&RDS |
12471 | 3 | S&P |
15631 | 2 | Badea&friends |
15878 | 2 | Clapton&Steve |
16019 | 2 | D&D |
16520 | 2 | IT&C |
17656 | 2 | Standard&Poor's |
25338 | 1 | C&A |
27445 | 1 | H&M |
Rank in Wordlist | Frequency | Word |
---|---|---|
23475 | 1 | 15$ |
28037 | 1 | Ke$ha |
Rank in Wordlist | Frequency | Word |
---|---|---|
8034 | 5 | Moody's |
16841 | 2 | McDonald's |
17656 | 2 | Standard&Poor's |
17857 | 2 | Valentine's |
23778 | 1 | 27.04.'89. |
24147 | 1 | 83-'87 |
24184 | 1 | 97-'98 |
25554 | 1 | Carolina's |
25719 | 1 | Christie's |
26096 | 1 | Critic's |
Rank in Wordlist | Frequency | Word |
---|---|---|
2306 | 20 | lei/actiune |
2386 | 19 | 6/49 |
7801 | 5 | A/H1N1 |
10473 | 4 | dolari/baril |
12568 | 3 | Tecău/Lindstedt |
12569 | 3 | Tecău/Robert |
13901 | 3 | km/h |
15379 | 2 | 5/40 |
15380 | 2 | 50/2008 |
17444 | 2 | România/Suedia |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots